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Method for Transforming a Digital Signal from the Time Domain 
into the Frequency Domain and Vice Versa 

Cross Reference to Related Applications 

5 This application claims the benefit of priority of U.S. 
Provisional Application no 60/507,210, filed 29 September 
2003, and U.S. Provisional Application no 60/507,440, filed 
29 September 2003, the contents of each being hereby 
incorporated by reference in its entirety for all purposes. 

10 

Further, the following commonly-owned applications are 
concurrently-filed herewith, and herein incorporated in its 
entirety: 

"Method for Performing a Domain Transformation of a 
15 Digital Signal from the Time Domain into the Frequency Domain 
and Vice Versa, " Atty. Docket No. P100442, and 

"Process and Device for Determining a Transforming 
Element for a Given Transformation Function, Method and 
Device for Transforming a Digital Signal from the Time Domain 
20 into the Frequency Domain and vice versa and Computer 
Readable Medium, " Atty. Docket No. P100452.-* 



This invention relates to a method for transforming a digital 
25 signal from the time domain into the frequency domain and 
vice- versa. 

Domain transformations, for example the discrete cosine 
transform (DCT) , are widely used in modern signal processing 
30 industry. Recently, a variant of the DCT, called integer DCT, 
has attracted a lot of research interests because of its 
important role in lossless coding applications. The term 
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"lossless" means that the decoder can generate an exact copy 
of the source signal from the encoded bit-stream* 

The DCT is a real-valued block transform. Even if the input 
block consists only of integers, the output block of the DCT 
can comprise non-integer components. For convenience, the 
input block is referred to as input vector and the output 
block as output vector. If a vector comprises only integer 
components, it is called an integer vector. In contrast to 
the DCT, the integer DCT generates an integer output vector 
from an integer input vector. For the same integer input 
vector, the integer output vector of integer DCT closely 
approximates the real output vector of DCT. Thus the integer 
DCT keeps all the good properties of the DCT in- spectrum 
analysis . 

An important property of the integer DCT is reversibility. 
Reversibility means that there exists an integer inverse DCT 
(IDCT) so that if the integer DCT generates an output vector 
Y from an input vector x, the integer IDCT can recover the 
vector x from the vector Sometimes the integer DCT is also 
referred to as the forward transform, and the integer IDCT as 
the backward or inverse transform. 

A transform called integer modified discrete cosine transform 
(IntMDCT) is recently proposed and used in the ISO/IEC MPEG-4 
audio compression. The IntMDCT can be derived from its 
prototype — the modified discrete cosine transform (MDCT) . 
The disclosure by H. S. Malvar in "Signal Processing with 
Lapped Transforms" Artech House, 1992 provides an efficient 
realization of MDCT by 'cascading- a bank of Givens rotations 
with a DCT-IV block. It is well known that Givens rotation 
can be factorised into three lifting steps for mapping 
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integers to integers. See e.g., R. Geiger, T. Sporer, J. 
Roller, K. Brandenburg, "Audio Coding based on Integer 
Transforms" AES III th Convention, New York, USA, Sept. 2001. 

5 Therefore, the realization of IntMDCT relies on an efficient 
implementation of integer DCT-IV. 

Integer transforms can be directly converted from their 
prototypes by replacing each Givens rotation with three 
lifting steps. Because in each lifting step there is one 

10 rounding operation, the total rounding number of an integer 
transform is three times the Givens rotation number of the 
prototype transform. For discrete trigonometric transforms 
(for example the Discrete Fourier Transform (DFT) or the 
Discrete Cosine Transform ( DCT ) ) , the number of Givens 

15 rotations involved is usually at Nlog 2 N level, where N is 
the size of the blocks, i.e. the amount of data symbols 
included in each block, the digital signal is divided into. 
Accordingly, the total rounding number is also at Nlog 2 N 
level for the family of directly converted integer 

20 transforms. Because of the roundings, an integer transform 
only approximates its floating-point prototype. The 
approximation error increases with the number of roundings. 

■ 

Accordingly what is needed are systems and methods for domain 
25 transforming a digital signal in a more efficient manner. 

* 

Summary of the Invention 

The present invention provides systems and methods for domain 
transforming a digital signal, whereby two blocks of input 
30 data are concurrently domain transformed in the same 
operation. This configuration reduces the number of 
effective rounding operations, and accordingly the 
approximation error. 
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In one embodiment of the invention, a method of the invention 
a method for transforming a digital signal from the time 
domain into the frequency domain and vice versa using a 
5 transformation function is presented. The transformation 
function comprises a transformation matrix, the digital 
signal comprises data symbols which are grouped into a 
plurality of blocks, each block comprising a predefined 
number of the data symbols. The method comprises 
10 transforming two blocks of the digital signal by one 

transforming element, wherein the transforming element 
corresponds to a block-diagonal matrix comprising two sub- 
matrices, wherein each sub-matrices comprises the 
transformation matrix and the transforming element comprises 

• * * 

15 a plurality of lifting stages and wherein each lifting stage 
comprises the processing of blocks of the digital signal by 
an auxiliary transformation and by a rounding unit. 

These and other features of the invention will be better 
20 understood when viewed in light of the drawings and detailed 
description of the specific embodiments. 

Brief Description of the Drawings 

Figure 1 shows the architecture of an audio encoder according 
25 to an embodiment of the invention; 

Figure 2 shows the architecture of an audio' decoder according 
to an embodiment of the invention, which corresponds to the 
audio coder shown in figure 1; 

30 

Figure 3 shows a flow chart of an embodiment of the method 
according to the invention; 
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Figure 4 illustrates an embodiment of the method according to 
the invention using DCT-IV as the transformation functions- 
Figure 5 illustrates the algorithm for the reverse 
5 transformation according to the embodiment of the method 
according to the invention illustrated in figure 4; 

Figure 6 shows the architecture for an image archiving system 
according to an embodiment of the invention; 

10 

Figure 7 shows forward and reverse transform coders used to 
evaluate the performance of the proposed system and method. 



Detailed Description of Specific Embodiments 

15 Figure 1 shows the architecture of an audio encoder 100 
according to an embodiment of the invention. The audio 
encoder 100 comprises a conventional perceptual base layer 
coder based on the modified discrete cosine transform (MDCT) 
and a lossless enhancement coder based on the integer 

20 modified discrete cosine transform (IntMDCT) . 

An audio signal 109 which, for instance, is provided by a 
microphone 110 and which is digitalized by a analog-to- 
digital converter 111 is provided to the audio encoder 100. " 
25 The audio signal 109 comprises a plurality of data symbols. 
The— audio signal 109 is divided into a plurality of blocks, 
wherein each block comprises a plurality of data symbols of 

j — j i » ^ n ^^..j « _s — .v. -i « , _ _p ^ -, i__ _ _ 

liic _l Lax o J_y iicx-l ana caon _lo Lj.aiioxuj.mcu *->y a. 

modified discrete cosine transform (MDCT) device 101. The 
30 MDCT coefficients are quantized by a quantizer 103 with the 

iiCJLp ui d ptJi.CepL.ucix iii^Litsj. iuz., 111c pciLcptua± hiOucj. Cuiiuj.uj.o 

the quantizer 103 in such a way that the audible distortions 
resulting from the quantization error are low. The quantized 
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MDCT coefficients are subsequently encoded by a bitstream 
encoder 104 which produces the lossy perceptually coded 
output bitstream 112. 

5 The bitstream encoder 104 losslessly compresses its input to 
produce an output which has a lower average bit-rate than its 
input by standard methods such as Huffman-Coding or Run- 
Length-Coding. The input audio signal 109 is also fed into an 
IntMDCT device 105 which produces IntMDCT coefficients. The 
10 quantized MDCT coefficients, which are the output of the 

quantizer 103, are used to predict the IntMDCT coefficients. 

* 

The quantized MDCT coefficients are fed into an inverse 
quantizer 106 and the output (restored or non-quantized MDCT 
coefficients) is fed into a rounding unit 107. 

15 

The rounding unit rounds to an integer value the supplied 
MDCT coefficients, and the residual IntMDCT coefficients, 
which are the difference between the integer value MDCT and 
the IntMDCT coefficients, are entropy coded by an entropy 

20 coder 108. The entropy encoder, analogous to the bitstream 
encoder 104, losslessly reduces the average bit-rate of its 
input and produces a lossless enhancement bitstream 113. The 
lossless enhancement bit stream 113 together with the 
perceptually coded bitstream 112, carries the necessary 

25 information to reconstruct the input audio signal 109 with 
minimal error. 

Figure 2 shows the architecture of an audio decoder 200 
comprising an embodiment of the invention, which corresponds 
30 to the audio coder 100 shown in figure 1. The perceptually 
coded bitstxeam 207 rs su pplied to a bitstream decoder 201, 
which performs the inverse operations to the operations of 
the bitstream encoder 104 of Figure 1, producing a decoded 



WO 2005/031596 



PCT/SG2004/000121 



7 

bitstream. The decoded bitstream is supplied to an inverse 
quantizer 202, the output of which (restored MDCT 
coefficients) is supplied to the inverse MDCT device 203. 
Thus, the reconstructed perceptually coded audio signal 209 
is obtained. 

The lossless enhancement bitstream 208 is supplied to an 
entropy decoder 204, which performs the inverse operations to 
the operations of the entropy encoder 108 of figure 1 and 
which produces the corresponding residual IntMDCT 
coefficients. The output of the inverse quantizer 202 is 
rounded by a rounding device 205 to produce integer value 
MDCT coefficients. The integer value MDCT coefficients are 
added to the residual IntMDCT coefficients, thus producing 
the IntMDCT coefficients. Finally, the inverse IntMDCT is 
applied to the IntMDCT coefficients by an inverse IntMDCT 
device 206 to produce the reconstructed losslessly coded 
audio signal 210. 

Figure 3 shows a flow chart 300 of an embodiment of the 
method according to the invention using the DCT-IV as a 
transformation and using three lifting stages, a first 
lifting stage 301, a second lifting stage 302 and a third 
lifting stage 303. This method is preferably used in the 
IntMDCT device 105 of figure 1 and the inverse IntMDCT device 
206: of figure 2 to implement IntMDCT and inverse IntMDCT, 
respectively. In Fig. 3, x x and x 2 are first and second blocks 

or~~Crre — uigiLax Sj-gnax, rco^cL-uivcx^. j-o an xiiuexmeuxa^e 

.signal., and £i and g 2 are output signals corresponding to the 
first and second block of the digital signal, respectively. 

As explained above, the DCT-IV-algorithm plays an important 
role in lossless audio coding. 
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The transformation function of the DCT-IV comprises the 

transformation matrix C™ r According to this . embodiment of 

— — — 

the invention, the transforming element corresponds to a 
block-diagonal matrix comprising two blocks, wherein each 

i 

block comprises the transformation matrix C% . 



So, in this embodiment, the transformation matrix 
corresponding to the transforming element according to the 
invention is: 



C 



C% shall in the context of this embodiment be referred to 
as the transformation matrix henceforth. 

The number of lifting matrices, and hence the number of 
lifting stages in the transformation element, in this 
embodiment of the invention, wherein DCT-IV is the 
transformation function, is three. 

The DCT-IV of an itf-point real input sequence x(n) is defined 
as follows: 



Let be the transformation matrix of OCT -XV, that is 
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The following relation holds for the inverse DCT-IV matrix 



5 {CZY=C% (3) 



In particular, the matrix C% is involutory 



With x = [jc(n)Lu„ .^_x and y - [y(m)Lw,. ' equation (1) can be 
10 expressed as 



y = c%x 



(4) 



Now, let xi, x 2 be two integer JSTxl column vectors. The 
15 column vectors x 3 , x 2 correspond to two blocks of the digital 
signal which, according to the invention, are transformed by 
one transforming element. The DCT-IV transforms of x lr x 2 are 
Yit Y?t respectively. 



20 y.-C^x, 



(5) 



(6) 



Combining (5) and (6) : 
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The above diagonal matrix is the block-diagonal matrix that 
the transforming element, according to the invention, 
corresponds to. 



5 It is within the scope of the invention if the above equation 
is changed by simple algebraic modifications like the one 
leading to 



10 



3>2 



(8) 



Let 7!^ be the counter diagonal matrix in (8) , that is 



?*2N 858 



'N 



t/V 

'N 



(9) 



15 The matrix can be factorised as follows: 



T = 
± 2N 



t/Vl 
N 



N 



-I„ c 



IV 



IV 

'N 



-C w I 



N *N 



N 



N 



>IV 



C iy I 



(10) 



20 



25 



where J N is the NxN identity matrix. 



Equation (10) can be easily verified using the DCT-IV 
property in (3). Using (10), Equation (8) can be expressed as 



y 2 



N 



-C" I 



N * N 



-I„ c 



IV lr 



N_ 
IV 



C IY I 



2. 



j u — J 



(ID 



The three lifting matrices in equation (11) correspond to the 



three lifting stages shown in Fig. 3. 
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From (11)/ the following integer DCT-IV algorithm that 
computes two integer DCT-IVs with one transforming element is 
derived. 

5 

Figure 4 illustrates the embodiment of the method according 
to the invention using DCT-IV as the transformation function. 
This embodiment is used in the audio coder 100 shown in Fig. 
1 for implementing IntMDCT. Like in Figure 3, x x and x 2 are 
10 two blocks of the input digital signal, z is an intermediate 
signal, and y.x anc * Y.2 are corresponding blocks of the output 
signal. 

The three lifting stages illustrated* in. figure 4 correspond 
15 to the three lifting matrices in equation (11) . 

As illustrated by figure 4, the time to frequency domain 
integer transform is determined by the following: 



20 In the first stage 401 f x 2 is transformed by a DCT-IV 

transformation 402 and the DCT-IV coefficients are rounded 
4 03. The rounded DCT-IV coefficients are then added to xi 
404. Thus, the intermediate signal z is generated. So, the 
intermediate signal z fulfils the equation: 



25 



g= [I X' * 2 ] + *! (12a) 



In the second stage 405/ z is transformed by a. DCT-IV 
transformation 406 and the DCT-TV coefficients are rounded 
30 407. From the rounded DCT-IV coefficients x x is then 

subtracted. Thus, the output signal is generated. So, the 
-0.u.t.pu t signal _fuXf.ils the equation : 
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In the third stage 409, ^ is.' transformed by a DCT-IV 
transformation 410 and the DCT-IV coefficients are rounded 
5 411. The rounded DCT-IV coefficients are then subtracted from 
z. Thus, the output signal y.2 is generated. So, the output 
signal %2 fulfils the equation: 



Figure 5 illustrates the algorithm for the reverse 
transformation according to an embodiment of the method 

15 according to the invention using DCT-IV as the transformation 
function. This embodiment is used in the audio decoder 200 
shown in Fig. 2 for implementing inverse IntMDCT. The 
algorithm illustrated in figure 5 is the inverse of the 
algorithm illustrated in figure 4 . The denotations for the 

20 different signals Zi,*X2, ££ir *2 and z are chosen 
corresponding to the denotations of figure 4 . 

As illustrated by Figure 5, the frequency to time domain 
integer transform is determined by the following: 

25 

In the first, stage 501, £i is transformed by a DCT-IV 
transformation 502 and the DCT-IV coefficients are rounded 

503. The rounded DCT-IV coefficients are then added to % 2 

504. Thus, the intermediate signal z is generated. So, the 
30 intermediate signal z fulfils the equation: 



(12c) 



10 



where [*J denotes rounding operation. 



(13a) 
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In the second stage 505, z is transformed by a DCT-IV 
transformation 506 and the DCT-IV coefficients are rounded 
507. From the rounded DCT-IV coefficients £i is then 
subtracted- Thus, the signal x 2 is generated- So, the signal 
x 2 fulfils the equation: 

*a = |c£>J-* < 13b > 

In the third stage 509, x 2 is transformed by a DCT-IV 
transformation 510 and the DCT-IV coefficients are rounded 
511. The rounded DCT-IV coefficients are then subtracted from 
z. Thus, the signal Xj is generated. So, the signal x 2 
fulfils the equation: 

^ = -|c^]+z (13c) 



It can be seen that the algorithm according to the equations 
(13a) to (13c) is inverse to the algorithm according to the 
equations (12a) to (12c) . Thus, if used in the encoder and 
decoder illustrated in figures 1 and 2, the algorithms 
provide a method and an apparatus for lossless audio coding. 

In an embodiment of the invention explained below, the method 
described above is used for an image archiving system. 

The equations (12a) to (12c) and (13a) to (13c) further show 
that to compute two NxN integer DCT-IVs, three NxN DCT- 
IVs v _thrjeje^JVxl_ro.uxidings, and three Nxl additions are 
needed. Therefore, for one NxN integer DCT-IV, the average 
is: 
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RC(N) = 1.5N (14) 



AC(N) = 1J5AC(C% )+1.5N (15) 



where RC(.) is the total rounding number, and AC(.) is the 
total number of arithmetic operations. Compared to the 
directly converted integer DCT-IV algorithms, the proposed 
integer DCT-IV algorithm reduces RC from level Nlog 2 N to N. 

As indicated by (15), the arithmetic complexity of the 
proposed integer DCT-IV algorithm is about 50 percent more 
than that of a DCT-IV algorithm. However, if RC is also 
considered, the combined complexity (AC+i?C) of the proposed 
algorithm does not much exceed that of the directly converted 
integer algorithms. Exact analysis of the algorithm 
complexity depends on the DCT-IV algorithm used. 

As shown in figures 4 and 5, the proposed integer DCT-IV 
algorithm is simple and modular in structure. It can use any 
existing DCT-IV algorithms in its DCT-IV computation block. 
The proposed algorithm is suitable for applications that 
require IntMDCT, e.g. in the MPEG-4 audio extension 3 
reference model 0. 

Figure 6 shows' the architecture of an image archiving system 
according to an embodiment of the invention. 

In Figure 6 an image source 601, for instance a camera, 
provides an analog image signal. The image signal is 
processed by a analog-to-digital converter 602 to provide a 
corresponding digital image signal. The digital image signal 
is losslessly encoded by a lossless image— encoder 603 which 
includes a transformation from the time domain to the 
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frequency domain. In this embodiment , the time domain 
corresponds to the coordinate space of the image. The 
lossless coded image signal is stored in a storage device 
604, for example a hard disk or a DVD. When the image is 
needed, the losslessly coded image signal is fetched from the 
storage device 604 and provided to a lossless image decoder 
605 corresponding to the lossless image encoder 603 which 
decodes the losslessly coded image signal and reconstructs 
the original image signal without any data loss. 

Such lossless archiving of image signals is important, for 
example, in the case that the images are error maps of 
semiconductor wafers and have to be stored for later 
analysis . 

In this embodiment of the invention, the embodiment of the 
method illustrated in figures 3 to 5 is used in the lossless 
image encoder 603 and the lossless image decoder 605. As 
explained above, the embodiment of the method illustrated in 
figures 3 to 5 provides a transformation which is reversible, 
thus in particular providing a method for lossless image 
coding . 

The method according to the invention is not limited to audio 
are image signals. Other digital signals, for example video 
signals, can as well be transformed by the method according 
to the invention. 

In the following, a further embodiment of the method for the 
transformation of a digital signal from the t'ime domain to 
the frequency domain and vice versa according to the 
invention is explained. 
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In this embodiment of the present invention , the domain 
transformation is a DCT transform, whereby the block size N 
is some integer. In one embodiment, N is a power of two. 



5 Let C£ be the NxN transform matrix of DCT (also called 
Type-II DCT) : 



C£ «^^^cos(in(n + V2W^)] (16) 



10 where 



JVV2 if m = 0 
[1 if m^O 



and N is the transform size, m and n are matrix indices 



Let CjJ' be the NxN transform matrix of type-IV DCT, as 
already defined above: 



C# =VW[«>s((/« + l/2X« + V2>r/^)] (18) 



As above, a a plurality of lifting matrices will be used, 
which lifting matrices are in this embodiment 2N x2N 
matrices of the following form: 



25 L 



2N 



(19) 
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• 

where I,, is the NxN identity matrix, Oj, is the NxN zero 

• 

matrix, is an arbitrary NxN matrix. 



For each lifting matrix , a lifting stage reversible 
5 integer to integer mapping is realized in the same way as the 
2x2 lifting step described in the incorporated reference 
"Factoring Wavlet Transforms into Lifting Steps," Tech. 
Report, I. Daubechies and W. Sweldens, Bell Laboratories, 
Lucent Technologies, 1996. The only difference is that 
10 rounding is applied to a vector instead of a single variable. 

In the above description of the other embodiments, it was 
already detailed how a lifting stage is realized for a 
lifting matrix, so the explanation of the lifting stages 
15 corresponding to the lifting matrices will be omitted in the 
following. 



One sees that the transposition of , L* 2N is also a 
lifting -matrix. 

In this embodiment, the transforming element corresponds to a 
matrix, which is defined as a 2Nx2N matrix in the 

following way: 



T = 



O 

o c w 



(20) 



The decomposition of the matrix into lifting matrices 

has the following form: 



=E3-L8-L7-L6-P2-L5-L4'L3-L2-L1-P1 (21) 



30 
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The matrices constituting the right hand side of the above 
equation will be explained in the following. 

PI is a first permutation matrix given by the equation 



Pl = 



J* o N 



(22) 



where J N is the NxN counter index matrix given by 



10 



0 
0 
0 

1 



0 
0 

0 



0 1 

1 0 
0 0 
0 0 



(23) 



and D N is a NxN diagonal matrix with diagonal element 
being 1 and -1 alternatively: 



15 



D 



N 



1 


0 


0 


0 


0' 


0 


-1 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


• 

• 

• 


0 


0 


0 


0 


0 


-1 



(24) 



P2 is a second permutation matrix, an example of which is 
generated by the following MATLAB script: 



Pd = eye(2*N) ; 
for i = 2:2:N, 

Pd(i,i) = 0; Pd(N+i,N+i) = 0; 

Pd(i,N+i) = 1; Pd(N+i,i) = 1; 

25 end 
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Peo = zeros (2*N); 
for i = 1:N, 

Peo(i,2*i-l) = 1; 

Peo(i+N,2*i) = 1; 

end 

P2 = (Pd*Peo) 1 ; 



As an example, when J\T is 4, P2 is a 8x8 matrix given as 



1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


1 


0 


0 


0 


0 



for N~4 



(25) 



P3 is a third permutation matrix, an example of which is 
generated, by the following MAT LAB script: 



P3 = zeros (2*N) ; 
for i = 1:N, 

P3(i,2*i-1) = 1; 

P3(N2-i+l,2*i) = 1; 

CliU 



As an example, when N is 4, P3 is a 8x8matrix given as 
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P3 = 



1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 



for N = 4 (26) 



LI is a first lifting matrix 

I 



Ll = 



O 



.w 



(27) 



where Zl^ is a iVxiV counter diagonal matrix given as 



Zl„ = 



0 
0 
0 



0 
0 



-tan((ZAT-l>r/8iV) 0 



0 

tan(3^r/8i^) 
0 
0 



- tan(^/8JNr)" 
0 
0 
0 



(28) 



10 L2 is a second lifting matrix 



L2 = 



O 



JV 



(29) 



where Z2 N is a NxN counter diagonal matrix given as: 



15 



Z2„ = 



0 
0 
0 



0 
0 



sin{3n/4N) 



oin^TT /A AT" I 



0 

* 

0 

0 



sw{{2N-lpc/4N) 
0 
0 
0 



(30) 



L3 is a third lifting matrix: 
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L3 



Z3 A I N 



(31) 



where 



Z3 N - J2C% +l N + Z1 N 



(32) 



L4 is a fourth lifting matrix 



10 



L4 



(33) 



where : 



Z4„=Cjj7V2 



(34) 



15 L5 is a fifth lifting matrix: 



L5 



Z5n iff 



(35) 



where: 



20 



Z5„-4/2Cjr+lJ 



(36) 



L6 is a sixth lifting matrix: 



25 



L6 = 



Z6^ Ij, 



(37) 



where Z6 N is a NxN counter diagonal matrix given as: 
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Z6„ = 



0 0 

0 0 
0 

tan(tf/8) 0 



0 

tan(7r/8) 
0 
0 



tan(tf/8) 
0 
0 
0 



(38) 



L7 is a seventh liftiiig matrix: 



L7 = 



I„ Z7 



N ""If 



o 



N *N 



(39) 



where Z7 N is a NxN counter diagonal matrix given as 



10 



Z7„ = 



0 
0 
0 

sin(w/4) 



0 0 
0 

sin (w/4) 0 

0 0 



-sin(/r/4) 
0 
0 
0 



(40) 



15 



L8 is an eighth lifting matrix: 



L8 = L6 



(41) 



thus, resulting in the factorization as shown in (42) : 



=P3-L8-L7-L6-P2-L5-L4-L3-L2-L1-P1 



(42) 



20 where PI, P2, and P3 are three permutation matrices. L; , ; 
from 1 to 8, are eight lifting matrices. 



The lifting matrices L3, L4 and L5 comprise an auxiliary 
transformation matrix, which is, in this case, the 

25 transformation matrix itself. 
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From Eq. (42) , it is possible to compute the integer DCT for 
two input signals of dimension Nxl. 

As Eq- (42) provides a lifting matrix factorization which 
describes the DCT-IV transformation domain, its lifting 
matrices can be used in the manner shown herein to compute 
the domain transformation of an applied input signal. 

The equation (42) can be derived in the following way. 

The following decomposition can be derived using the 
disclosure from Wang, Zhongde, "On Computing the Discrete 
Fourier and Cosine Transforms", IEEE Transactions on 
Acoustics, Speech and Singal Processing, Vol.. ASSP-33, No. 4 
15 October 1985: 



10 



xg^n 



'N/2 



&N/2 



l N 



(43) 



mil 



'N/2 



'N/2 



P -T 

r DJ X N 



is known, wherein S** 2 denotes the transformation matrix of 
20 the discrete sine transform of type 2, 



ri 



T>_ T 



25 



P N is a NxN permutation matrix given by 



p 

* N 



l N/2 



(44) 



N/2 
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% = 



COS 



4N 



cos 



3n 
4N 



- sin 



33t 

4N 



- sin 



or 



4N 



cos 



- sin 



(n - l) n 
4N 

(N - l) 31 



sm 



cos 



4N 



(N - l) « 
4N 

(n - l) n 

4N 



sin 



sin 



3jc 
4N 



cos 



3k 
4N 



JC 



4N 



cos 



JC 



4N 



and 



B N 



42 



41 



i -i 
i i 



i -l 
i i 



42 



Equation (85) can be combined with the equation 



10 Cjjf =Rpo 



'N/2 



Vtf/2 



wherein Peo is an even-odd permutation matrix, 



(45) 



1 


In/2 


V2 


1 N/2 

m 



N/2 



-/ 



N/2 
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Rpo equals T N , 



2 



2 



After transposition equation (45) converts to 



V2 



^N/2 



f*IV 



^N/2 



'N/2 



(46) 



The combination of (43) and (46) yields: 



^N/2 



'N/2 



P B 

*N °N 



V2 



V/V/2 



^N/2 



^72 



Vtf/2 
Vtf/2 



(47) 



R, P, 



where : 



p 2 =(p £0 r-(Pz>r=(p D -p £0 ) r 



P 3 -TV 



R 2 =B^ 



from (47), equation (42) can be easily derived. 
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In this embodiment , the computation of the domain 
transformation requires only 4N rounding operations, as will 
now be explained: 

5 Let oc(*) be the number of real additions, jz(*) be the number 
of real multiplications, and be the number of real 

roundings, respectively. For the proposed IntDCT algorithm, 
one gets: 

a (IntDCT) = M + 3a(DCT - IV) 
10 ^(IntDCT) =9N + 3fi(DCT - IV) 

7 (IntDCT) = 87^ 



The above results are for two blocks of data samples, because 
the proposed IntDCT algorithm processes them together. Thus 
15 for one block of data samples, the numbers of calculations 
are halved, which are 

a x (IntDCT) - 5 jSN + 1 5a (DCT - IV) 
ft (IntDCT) = 45N +1.5^ (DCT - IV) 

20 ^(intDCT)-^ 

where a lf ft, and y x are the number of real additions, number 
of real multiplications, and number of real roundings, for 
one block of samples, respectively. 



25 For DCT-IV calculation, the FFT-based algorithm described in 
the incorporated reference "Signal Processing with lapped 
Transforms," H. S. Malvar, Norwood, MA. Artech House, 1992, 
pp. 199-201 can be used, for which 



30 



a (DCT -IV) - LSNlo^N 
ju(DCT - IV) = 0.5IUog 2 N + N 
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Consequently: 

^(intDCT) = 2.25^1(^2 N + 55N 
/i, (intDCT) - 0.75^ log 2 N + 6N 

In the following, a further embodiment of the method for the 
transformation of a digital signal from the time domain to 
the frequency domain and vice versa according to the 
invention is explained. 

In this embodiment a discrete fast fourier transform (FFT) is 
used as the domain transformation* 

Let' F be the N * N transform matrix of the normalized FFT 

m,n~0X- -,N-1 (48) 



where N is the transform size, m and n are matrix indices. 

Under this embodiment, a permutation matrix P of dimension 
NxN i s a matrix which includes indices 0 or 1. After 
multiplying it with a vector (the matrix representation 

of the input signal) , the order of elements in the vector are 
changed- 

In this embodiment, lifting matrices are defined as 2Wx2W 
matrices of the following form: 
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Pi A 
O P, 



(49) 



10 



where ^ and ^ 2 are two permutation matrices, Ois the N xN 

zero matrix, A ± s an arbitrary NxN matrix. For lifting 

matrix reversible integer to integer mapping is realized 

in the same way as the 2x2 lifting step in the 
aforementioned incorporated reference of I. Daubechies. As 
above, however, rounding is applied to a vector instead of a 
single variable. It is apparent that the transposition of 

L f L r is also a lifting matrix. 



15 



Further, let T be a 2Nx2N transform matrix 



ro f 

F O 



(50) 



20 



Accordingly, the modified transform matrix T (and 
accordingly the domain transformation itself) can be 
expressed as the lifting matrix factorization: 



I 


O 


• 


-Q 


Fl 


I O 


-Q F 


I 




O 


I 


F I 



(51) 



where I is the NxN identity matrix, and Q is a NxN 
permutation matrix given as: 



Q = 



O 



O 



UN-1 
J 



(52) 
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and ^w- 1 and are row and column vectors of #-1 zeros 

respectively. 

J is the (W-l) x 0^-l) counter index matrix given by 



(53) 



In Eq. (53) , blank space in the square bracket represents all 
10 zeros matrix elements. 

A can be seen from Eq. (51) , the lifting matrix factorization 

can be use to compute the integer FFT for two complex 
vectors using the methods as described herein. 

15 

Under this embodiment, the computation of the domain 
transformation requires only 3N rounding operations, as will 
now be explained: 

20 Let: a (*) be the number of real additions, 

be the number of real multiplications, and 

> be the number of real rounding operations, 
respectively . 

Z5 For the proposed IntTFFT algorithm, we have 



a(lntHFT) = 6JV + 3a(FFT) 
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|i(lntFFT)=3/i(FFT) 
y(lntFFT)=6iV r 

The above results are for two blocks of data samples , because 
the proposed IntFFT algorithm processes them together. Thus 
for one block of data samples, the numbers of calculations 
are halved, which are 

a x (IntFFT) = 3N + 1.5a (FFT) 

/i 1 (lntFFT) = 1.5/i(FFT) 

Y 1 {MFFr) = 3N 

where a *, ^ , and ^ are the number of real additions, number 
of real multiplications, and number of real rounding 
operations for one block of samples, respectively. 

V 

For FFT calculation, the split-radix FFT (SRFFT) algorithm 
can be used, for which: 

a(SRFFT) = 3JNTlog 2 JV-3JV + 4 

|i(SRFFT) - N log 2 JV-3JV + 4 

Consequently, we have: 

a a (lntFFT) - 4.5J\riog 2 N-15N + 6 

/i 1 (lntFFT) = l^JV r log 2 N-4SN + 6 

Figure 7 shows forward and reverse transform coders used to 
assess the transform accuracy of the DCT transformation 
technique described above and the FFT domain transformation 
above. The test involved measuring the mean squared error 
(MSE) of the transform in accordance with the evaluation 
standards proposed by the MPEG- 4 lossless audio coding group 
as described in "Coding of Moving Pictures and Audio: Work 
plan for Evaluation of Integer MDCT for FGS to Lossless 
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Experimentation Framework, " ISO/IEC JTC 1/SC 29/WG 11 N5578 
pattaya, Thailand, Mar. 2003 incorporated herein. 

Specifically, the MSEs for IntDCT and integer inverse DCT 
(IntlDCT) are given as: 



-I K-l i N-l 



(54) 



where the error signal e is e f for IntDCT, and e ' for IntlDCT 
as in Fig 1. K is the total number of sample blocks used in 
the evaluation. 

The MSEs for IntFFT and integer inverse FFT (IntlFFT) are 
given as 



•I JC-l 1 AT-l 

KfaNU (55) 



where the error signal e is e ' for IntFFT, and e * for IntlFFT 

as in Fig 1. «^ represents norm of a complex value. K is the 
total nximber of sample blocks used in the evaluation. 

■ 

For both domain transformations, a total of 450 seconds with 
15 different types of music files are used in the 48 kHz/16- 
bit test set. Table I shows the test results. 

As can be seen from Table 1, the MSE generated • using the 
-systems and methods of the present invention is very minimal, 
and unlike conventional— sy-stems., .is substantially independent 
of the processing block size. Referring to the DCT-IV domain 
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transformation, the MSE only slightly increases with 
increasing block size N up to 4096 bits. The MSEs of the FFT 
are even better, exhibiting a constant MSE of 0 . 4 for block 
sizes up to 4096 bits. When the demonstrated performance of 
the present invention is viewed in light of the present 
capabilities and increasing need for longer block sizes, the 
advantages of the present invention become clear. 



10 



15 



In tDCT- IV 
0.537 



In tlDCT- IV 
0.537 




IntFFT 
0.456 



Table I 



I ntlFFT 
0.371 
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